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Invoke URL checker. 
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Scan Document to locate hypertext 
links including a URL. 
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For each located hypertext link, do: 
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Determine context terms in predetermined 
vicinity of located hypertext link. 



Issue HTTP Get request to access 
web page referenced in hypertext link 
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Scan received web page to determine 
instances of context terms In web page. 



1,14 



Does 
web page include 
sufficient number of instances of 
^text terms to satisfy qualify: 
threshold? 



-No- 



FIG.2 



-No—, 



118 

y f 



Goto block 150 in FIG. 3 
to seek correct URL. 
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Go to next hypertext link in document 
until all hypertext links considered. 
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For each top level domain (TLD) different 
from TLD domain in URL of hypertext link, 

generate a modified URL using each 
different TLD with domain name in URL. 




Add each modified URL with different 
TLD to URL variation list. 
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Parse domain name as one word or compound 
words and spell check the parsed term(s) to 
generate possible correct spelling of the term and/ 
or compound terms. 
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Generate set of possible correct spellings of 
domain name, which may Include different 
combinations of generated possible correct 
spelling of compound terms. 



Generate a modified URL for each 
possible correct spelling of domain 
name in the generated set. 



Append each modified URL generated with 
possible correct spelling to URL variation list. 
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Perform stemming algorithm on URL 

domain name to determine 
morphological variations of domain 
name and the compound terms that 
form the domain name. 



Generate a modified URL for each 
determined morphological variation 
of domain name or combination of 
variations of compound terms that 
form the domain name. 



Append each modified URL 
generated with morphological 
variation of domain name or 
compound terms forming domain 
name to URL variation list. 



Go to block 200 
in FIG. 4. 



1^8 



FIG. 4 



P. R. Dayet al. 
ROC920000170US1 
Sheet 4/5 



For each modified URL / in 
URL variation list, do: 
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Submit HTTP GET request 
to modified URL /. 
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Scan received web page to 

determine instances of 
context terms in web page. 
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Does 
^web page include" 
sufficient number of 
instances of context terms^ 
sjo satisfy qualifying 
threshold?^ 



No 



No 



Yes 




Append modified URL / to 
possible correct URL list. 
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Go back to block 200 until all URLs in 
URL variation list considered. 



Goto block 116 in FIG. 2 
for further hypertext links. 
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The first president of the United States was George 
Washington. President Washington, together with city planner 
Pierre L"Enfant, chose the present location of the White House, 
which is now 1 600 Pennsylvania Avenue. The White House 
has a long and fascinating hist ory. For more information on the 
White House, I suggest you go MIMro il/J 
is the official government web site of the White House. ^302 



